Statistical analysis of big data on pharmacogenomics.
نویسندگان
چکیده
This paper discusses statistical methods for estimating complex correlation structure from large pharmacogenomic datasets. We selectively review several prominent statistical methods for estimating large covariance matrix for understanding correlation structure, inverse covariance matrix for network modeling, large-scale simultaneous tests for selecting significantly differently expressed genes and proteins and genetic markers for complex diseases, and high dimensional variable selection for identifying important molecules for understanding molecule mechanisms in pharmacogenomics. Their applications to gene network estimation and biomarker selection are used to illustrate the methodological power. Several new challenges of Big data analysis, including complex data distribution, missing data, measurement error, spurious correlation, endogeneity, and the need for robust statistical methods, are also discussed.
منابع مشابه
Design and Test of the Real-time Text mining dashboard for Twitter
One of today's major research trends in the field of information systems is the discovery of implicit knowledge hidden in dataset that is currently being produced at high speed, large volumes and with a wide variety of formats. Data with such features is called big data. Extracting, processing, and visualizing the huge amount of data, today has become one of the concerns of data science scholar...
متن کاملCorrelation of Big Data with Supply Chain Health Performance in Employees of the Tehran Intelligent Fuel System
Introduction: The dramatic growth of big data and its application in preventing waste of resources and increasing financial performance and supply chain health levels, need to be examined from different perspectives. This study aimed to determine the correlation between big data and supply chain health performance in employees of Tehran Intelligent Fuel System. Methods: In this descriptive cor...
متن کامل2016 Olympic Games on Twitter: Sentiment Analysis of Sports Fans Tweets using Big Data Framework
Big data analytics is one of the most important subjects in computer science. Today, due to the increasing expansion of Web technology, a large amount of data is available to researchers. Extracting information from these data is one of the requirements for many organizations and business centers. In recent years, the massive amount of Twitter's social networking data has become a platform for ...
متن کاملNonparametric variable selection and dimension reduction methods and their applications in pharmacogenomics
Zhu, Jingyi Ph.D., Purdue University, December 2014. Nonparametric Variable Selection and Dimension Reduction Methods and Their Applications in Pharmacogenomics . Major Professor: Jun Xie. Nowadays it is common to collect large volumes of data in many fields with an extensive amount of variables, but often a small or moderate number of samples. For example, in the analysis of genomic data, the ...
متن کاملA Conceptual model on the pharmacogenomics implementation complications: An applied qualitative research based on national drug policy components
Introduction: Pharmacogenomics may well have substantial effects on the clinical, economic and regulatory aspects of health sector; which can lead to complications in access. Therefore, there is a need for evidence-based frameworks based on national drug policy components. The objective of the current study is to identify pharmacogenomics-based complications and develop a conceptual model. Mate...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Advanced drug delivery reviews
دوره 65 7 شماره
صفحات -
تاریخ انتشار 2013